Cosmos-Predict2.5 is a high-performance pre-trained world foundation model suite developed by NVIDIA specifically for physical AI. Based on diffusion model technology, it can generate high-quality images and videos with physical awareness based on text, image, or video input, providing world simulation capabilities for applications such as autonomous driving and robotics.
Multimodal
Diffusers